The SPHINX-II speech recognition system: an overview

نویسندگان

  • Xuedong Huang
  • Fil Alleva
  • Hsiao-Wuen Hon
  • Mei-Yuh Hwang
  • Kai-Fu Lee
  • Ronald Rosenfeld
چکیده

In order for speech recognizers to deal with increased task perplexity, speaker variation, and environment variation, improved speech recognition is critical. Steady progress has been made along these three dimensions at Carnegie Mellon. In this paper, we review the SPHINX-II speech recognition system and summarize our recent efforts on improved speech recognition. This research was sponsored by the Defense Advanced Research Projects Agency and monitored by the Space and Naval Warfare Systems Command under Contract N00039-91-C-0158, ARPA Order No. 7239. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the U.S. Government.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Overview of the SPHINX-II Speech Recognition System

In the past year at Carnegie Mellon steady progress has been made in the area of acoustic and language modeling. The result has been a dramatic reduction in speech recognition errors in the SPHINX-II system. In this paper, we review SPHINX-I/and summarize our recent efforts on improved speech recognition. Recently SPHINX-I/ achieved the lowest error rate in the November 1992 DARPA evaluations. ...

متن کامل

Understanding the CMU Sphinx Speech Recognition System

The Sphinx-II is a speech recognition engine developed by CMU .It can be used to build both small, medium or large vocabulary applications . This project focused on finding out how an speech recognition engine can be implemented using HMM by referring to one of the Sphinx developer’s thesis .We dissected the source code of Sphinx-II , find out each component of an speech recognition system , th...

متن کامل

Incorporating Lr Parsing into Sphinx

This paper describes the integration of an LR natural language parser with the SPHINX speech recognition system. SPHINX is one of the most successful speech recognition systems in use today. Although it attains high word accuracy, SPHINX often outputs ungrammatical recognition results because the baseline SPHINX system uses very simple word-pair or bigram language models. For applications of sp...

متن کامل

Applying SPHINX-II to the DARPA Wall Street Journal CSR Task

This paper reports recent efforts to apply the speaker-independent SPHINX-H system to the DARPA Wall Street Journal continuous speech recognition task. In SPHINX-H, we incorporated additional dynamic and speaker-normalized features, replaced discrete models with sex-dependent semi-continuous hidden Markov models, augmented within-word triphones with between-word triphones, and extended generali...

متن کامل

A Sphinx Based Speech-music Segmentation Front-end for Improving the Performance of an Automatic Speech Recognition System in Turkish

In this study a system that segments an audio signal as speech and music by using posterior probability based features is proposed and implemented in Sphinx. Unlike the earlier efforts that uses Multi-Layer Perceptrons (MLP), this system uses Hidden-MarkovModel based acoustic models that are trained in Sphinx for posterior probability calculations. Acoustic Models are trained with the HMM-state...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computer Speech & Language

دوره 7  شماره 

صفحات  -

تاریخ انتشار 1993